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Abstract 

We search for the purely leptonic decay — > pL^v^ in the full BABAR dataset, having an integrated 
luminosity of approximately 426 fb~^. We adopt a fully inclusive approach, where the signal 
candidate is identified by the highest momentum lepton in the event and the companion B is 
inclusively reconstructed without trying to identify its decay products. We set a preliminary upper 
limit on the branching fraction of B{B^ — > ^i^v^) < 1.3 x 10~^ at the 90% confidence level, using 
a Bayesian approach. 
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1 INTRODUCTION 



In the Standard Model (SM), the purely leptonic B decays i~^i^e { i = e, fi,T ) (charge 

conjugation is implied troughout the paper) proceed through the annihilation of the two quarks in 
the meson to form a virtual W boson (Fig. [1]). The branching ratio can be cleanly calculated in 
the SM, 

Stt V m| / 

where Gp is the Fermi coupling constant, nii and tub are the lepton and B meson masses, and tb 
is the -B"*" lifetime. The decay rate is sensitive to the Cabibbo Kobayashi Maskawa matrix element 

and the B decay constant fs which describes the overlap of the quark wave functions within 
the meson. Currently, the uncertainty on fs is one of the main factors limiting the determination 
of Vtd from precision B^B^ mixing measurements. Given a measurement of Vub from semileptonic 
decays such as B ^ Triiy, fs could be extracted from a measurement of the B^ — s- i'^i'i branching 
ratio. 

The SM estimate of the branching ratio for B~^ — > t'^Uj- is (1.59 it 0.40) x 10~^ assuming tb 
= 1.638ib0.011 ps, Vub = (4.39ib0.33)xl0~^[l] determined from inclusive charmless semileptonic 
B decays and fs = 216±22 MeV [2j from lattice QCD calculation. Due to helicity suppression, 
B~^ — > and B^ e^Ve are suppressed by factors of 225 and 10^ respectively, leading to 

branching ratios of B{B^ fJ-'^'^iJ,) — 4.7 x 10~^ and B{B'^ e^Ve) ^ 1.1 x 10~^^. 




Figure 1: SM annihilation diagram for B^ — > l^u^. 



Purely leptonic B decays are sensitive to physics beyond the SM due to possible insertion of New 
Physics (NP) heavy states in the annihilation process. Charged Higgs boson effects may greatly 
enhance or suppress the branching ratio in certain two Higgs doublet models Similarly, this 
decay may be enhanced through mediation by leptoquarks in the Pati-Salam model of quark-lepton 
unification [3]. 

Moreover, as in annihilation processes the longitudinal component of the vector boson is directly 
involved, this decay allows a direct test of Yukawa interactions in and beyond the SM. In particular, 
in a SUSY scenario at large tan/3 [0{mt/mb) » 1), non-standard effects in helicity-suppressed 
charged current interactions are potentially observable, being strongly tan/3 dependent: 

B{B^ ^ l^jyi) ^ B{B^ ^ l^ui)sM X (l - tanP^ml/M^jf. (2) 

Recently, Belle had a first evidence of a purely leptonic B decay. With 414 fb^^. Belle finds [5] 

j3(B+ ^ r+u,) = (l.79^0i6(stat)10;t?(syst)) x 10-^ (3) 
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at 3.5 (T significance. The most recent BABAR result on this channel uses an integrated luminosity 
of 346 fb~^, corresponding to 383 million of BB pairs, and sets an upper limit (UL) at 90% of 
confidence level on the branching ratio of B{B'^ t^i^t) < 1.7 x 10~^[6j and a central value 
B{B+ T+Ur) = (1.2 ± 0.4(stat) ± 0.3(bkg) ± 0.2(syst)) x IQ-"^^. 

BABAR has published a result on B{B^ — > fJ.'^v^) with 81 fb~^ and set an UL at 90 % confidence 
level of B{B~^ — > ^"""f^) < 6.6 x 10~^ [8]. The current best published upper limits at 90 % confidence 
level on B^ — > /i^t'^ and B^ — s- e'^Ve are from Belle Collaboration on 235 fb~^ [9j 

B{B+ fi+Uf,) < 1.7 X 10-^ 

B{B+ e+i^e) < 9.8 X 10~^ (4) 



2 THE BABAR DETECTOR AND DATASET 

This analysis is based on the data collected with the BABAR detector pO] at the PEP-II storage 
ring. The sample corresponds to an integrated luminosity of 426 fb~^ at the T{AS) resonance, 
consisting of about 447 millions of BB pairs, and 44fb~^ accumulated at a center-of-mass (CM) 
energy about 40 MeV below the T(4S') resonance. Off-resonance data are used as cross-checks for 
continuum qq {q = u, d, s, and ) and t~^t~ on-resonance events. In particular, given the variation 
of muon identification in time due to detector differences and changes, we considered the total 
dataset divided into data-taking periods (runs). 

Charged track reconstruction is provided by a Silicon Vertex Tracker (SVT) and a Drift Cham- 
ber (DCH) operating in a 1.5-T magnetic field. Particle identification is based on the energy loss 
dE/dx in the tracking system and the Cherenkov angle in an internally reflecting ring-imaging 
Cherenkov detector. Photon detection is provided by a CsI(Tl) Electromagnetic Calorimeter 
(EMC). Muons and neutral hadrons are identified by Resistive Plate Chambers and Limited 
Streamer Tubes in the Instrumented Flux Return (IFR) detector. 

A GEANT4-based [JJJ Monte Carlo (MC) simulation is used to model the detector response 
and test the analysis technique. A sample of about 28 million simulated B'^B~ events where -B"*" 
decays to /i^t'^ and the B~ decays generically is studied to evaluate the efficiency for the signal. 
Background sources considered include e~^e~ — > BB, e~^e~ qq {q = u, d, s, and ), and e~^e~ — > 
r+r~ in quantities comparable to three times (BB), twice (cc) and once {uds,T~^T~) the actual 
dataset luminosity. 



3 ANALYSIS METHOD 

B^ — >■ //"''f/i is a two-body decay so the muon must be mono-energetic in the B rest frame. The 
momentum p* of the muon in the B rest frame is given by 

p* = — ^ ^ 2.46GeV. (5 

where is the B mass and is the muon mass. At BABAR the CM frame is a good approximation 
to the B rest frame, so we initially select well-identified muon candidates with momentum pcM 
between 2.4 and 3.2 GeV/c in the CM frame. Since the neutrino produced in the signal decay is not 
detected, any other charged tracks or neutral deposits in a signal event must have been produced by 
the decay of the companion (tag) B. Therefore, the tag B can be reconstructed from the remaining 
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visible energy in the event. Signal decays can then be selected using the kinematic variables AE 
and energy-substituted mass, msS) defined by 

AE' = Eb — -Ebeam, (6) 

and 




where pB and Eb are the momentum and energy of the reconstructed tag B candidate in the CM 
frame and -Ebeam is the beam energy in the CM frame. We include all neutral calorimeter clusters 
with cluster energy greater than 30 MeV. Particle identification is applied to the charged tracks to 
identify electrons, muons, kaons and protons in order to apply the most likely mass hypothesis to 
each track and thus improve the AE and ttt-es resolution. Events with additional identified leptons 
are discarded to discriminate against events containing additional neutrinos. For signal events in 
which all decay products of the other B are reconstructed, we expect the AE distribution to peak 
near zero and mss to peak near m^. In reality, due to the inclusive nature of our analysis, we 
often fail to reconstruct all the decay products so that the AE distribution develops a negative 
tail while the niEs distribution exhibits a tail below the B mass. For uds and cc backgrounds, 
AE is shifted significantly greater than zero since we attribute too much energy to the opposite 
hemisphere decay. AE is negative for r+r^ decays due to missing neutrinos. Figure [2] shows the 
distributions of AE and mss for the on-peak data, background MC and signal MC after muon 
candidate selection. 




Figure 2: Distribution of AE and niEs after the muon selection: signal in blue histogram, data in 
black dots and background events are stacked on top of each other: uds in red, cc in yellow, t'^t~ 
in green, B^B^ in dark blue and B^B^ in light blue. The arrows indicate the requirements set on 
these variables. 

In order to obtain data/MC agreement, we extract the uds, cc and t~^t~ MC normalization 
coefficients from a fit to the AE data distribution, keeping the bb component fixed. The require- 
ments on tag B kinematical variables is optimized with the figure of merit Csig/ V-^bkg where egig 
is the signal efficiency and A'^bkg the number of background events. We require the tag B AE and 
ruES to be within -2.25 < AE < GeV and tues > 5.246 GeV/c^. 
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Once the tag B is reconstructed, we refine the estimate of the muon momentum in the B rest 
frame (p*)- We use the momentum direction of the tag B and assume a total momentum of 320 
MeV/c in the CM frame (from the decay of the T(45) BB) to boost the muon candidate into 
the reconstructed B rest frame. 

Backgrounds may arise from any process producing charged tracks in the momentum range of 
the signal, particularly if the charged tracks are muons. The two most significant backgrounds are 
B semileptonic decays involving b ujjLUfj^ transitions where the endpoint of the muon spectrum 
approaches that of the signal, and non-resonant qq (continuum) events where a charged pion is mis- 
takenly identified as a muon. In the continuum events, there must also be significant missing energy 
due to detector acceptance, neutral hadrons, or additional neutrinos that mimic the signature of 
the expected neutrino. 

Continuum backgrounds are suppressed using event shape variables. The light-quark events 
tend to produce a jet-like event topology as opposed to BB events which tend to be more isotrop- 
ically distributed in space. Several topological variables have been considered and five have been 
found to be the most discriminating, using an appropriate cocktail of different data-taking periods. 
These variables are combined in a Fisher discriminant [12j : the normalized second Fox- Wolfram 
moment R2 [13) . calculated using all charged tracks and neutral clusters in the event; the ratio of 
the second and zeroth Legendre Polynomials L2/LQ, where all tag B daughters momenta in the 
CM frame are included and the angle is measured with respect to the lepton candidate momentum; 
the cosine of the angle of the expected signal neutrino in the lab frame (as determined from the 
lepton candidate); the lepton transverse momentum in T(4S') frame; the sphericity of the event. 
The Fisher coefficients are optimized run-by-run. A cut is applied on the Fisher discriminant and 
is thus optimized for each run separately in order to have better performance. The efficiency of 
this cut is in the range 16% to 32% for signal events , 5% to 16% for hh events and less than 0.5% 
for continuum events. 

The two-body kinematics of this decay is now exploited by combining p* and pcM in a second 
Fisher discriminant in order to discriminate against the remaining semileptonic bh background 
events. Signal and background yields are obtained from a Maximuum Likelihood Fit using the 
Fisher output pfit- We parameterize signal MC with the sum of two Gaussians. As bb events 
and continuum qq and t~^t~ events are two background samples with different p-pn Probability 
Density Functions (PDFs), we parameterize them separately and construct a summed background 
PDF with relative normalizations fixed from simulated events. Both of them are parameterized 
with a Gaussian function with a different sigma for value above and below the peak (bifurcated 
Gaussians). Table [T] shows the fixed parameterization of signal and backgrounds PDFs, with 
purely statistical uncertainties arising from the size of the simulated datasets used to obtain the 
parameterizations. The p-piT distributions for simulated signal and background events are shown 
in Figure [3l Only the signal yield and the yield of the sum of all backgrounds are free parameters 
in the fit. 

4 SYSTEMATIC STUDIES 

To set an upper limit on the B~^ — > jJL^i'^ branching fraction we evaluate systematic uncertainties 
in the number of B^ in the sample, the signal efficiency and the signal yield. 

• The number of B^ mesons in the on-peak data sample is estimated to be 447 x 10^ with an 
uncertainty of 1.1% estimated studying /i^u pairs events |l4j . 
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Table 1: pfit distribution parameterization for signal MC (left), bb (center) and uds + cc + r+r" 
(right). The two summed Gaussians have parameters (/icorejCcore) and (/^taiijCtaii) respectively and 
/core is the relative fraction of the core Gaussian. The bifurcated Gaussian has /Ucore as mean and 
aL and aR as left and right a respectively. 



Parameter 


Signal MC 


bb 


uds + cc + r^T 


A'core 


5.32 ± 0.03 


-2.2 ± 0.3 


0.5 ± 1.1 


'-'^core(L) 


2.43 ± 0.04 


0.7 ± 0.2 


1.6 ± 0.7 




4.91 ± 0.03 






C^taiUR) 


1.39 ± 0.04 


3.6 ± 0.3 


6.8 ± 0.9 


/core 


0.54 ± 0.03 







• The uncertainty in the signal efficiency includes the muon candidate selection (particle iden- 
tification, tracking efficiency and Fisher requirement) as well as the reconstruction efficiency 
of the tag B. The muon identification efficiency systematic is evaluated using control samples 
derived from the BABAR data, which are weighted to reproduce the kinematic distribution of 
the muon signal candidate. Comparing the cumulative signal efficiency obtained with and 
without this weight, a total discrepancy of 2.9% is found and this value is taken as the muon 
ID systematic uncertainty. 

Charge conservation is imposed on r decays, which must proceed via an odd number of tracks 
and thus the number of events with a missing track can be used to evaluate the uncertainty 
associated with the tracking efficiency and the relative correction factor. The systematic 
uncertainty per track and the correction factor are taken in quadrature to give the total 
tracking efficiency uncertainty of 0.4% per track. 

In order to evaluate the systematic uncertainty associated with the requirements on the Fisher 
discriminants, we take the ratio between data and simulated events Fisher discriminant distri- 
butions in the AE and niES sidebands AE > GeV and 5.2 GeV/c^ < niES < 5.246 GeV/c^ 
for each different data-taking period. We fit the data/MC ratio for each run with a linear 
function. The mean weighted by the errors of slopes and intercepts returns a linear function 
consistent with a data/MC unitary ratio in the full Fisher range. We take the uncertainty of 
1.5% on the averaged intercept as the systematic error on the Fisher discriminant cut. 

The tag B reconstruction has been studied with a control sample of B~^ — > D^*^'^7r^ events, 
where the D is reconstructed into K^tt^ and K^tt^ and the D* into D*^ D^j 

or D*^ — > D^n^. This is also a two-body decay so it is topologically very similar to our signal. 
Once reconstructed, the pion can be treated as if it were the signal muon and the D^*^^ decay 
products are ignored to simulate the neutrino. The tag B is then reconstructed in the control 
sample as it would be for signal. We compare the efficiencies for our tag B selection cuts in 
the B^ — D^*^^Tr~^ data and MC to quantify any data/MC disagreements that may affect 
the signal efficiency. We find a data/MC discrepancy on B^ D^*^'^7t~^ control sample of 
2.7% and assign this as the signal efficiency uncertainty arising from the tag B selection. A 
summary of the systematic uncertainties in the signal efficiency is given in Table [2j The final 
signal efficiency is thus 4.64 ± 0.19 %. 

• The fit parameters are extracted from MC and are kept fixed in the final fit to extract the 
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Figure 3: Distribution of pfit with the fit superimposed: signal (top left), bb (top right) and cc + 
uds + T+r" (bottom). 



Table 2: Contributions to the systematic uncertainty on the signal efficiency. 



Source 


Relative Error 


muon identification 


2.9% 


tracking efficiency 


0.4% 


tag B reconstruction 


2.7% 


Fisher selection 


1.5% 


Total 


4.2% 



yields. These parameters are affected by an uncertainty due to the MC statistics, which is 
considered as a source of systematic uncertainty. In order to evaluate it, the final fit has been 
repeated 500 times for each background and signal PDF parameter. We randomly generate 
the PDF parameters assuming Gaussian errors and taking into account all the correlations 
between them. We perform a Gaussian fit to the distribution of the number of signal events for 
each parameter, take the fitted sigma as the systematic uncertainty and sum in quadrature. 
The total systematic uncertainty in the signal yield from all signal and background PDFs 



13 



parameterization is 13 events. 

We take into account possible discrepancies in the shape of the Pfit background distribution 
in data and simulated events using again the simulated events over data ratio in the IS.E 
and niEs sidebands /S.E > GeV and 5.2< tues < 5.246 GeV/c^. Given the low statistics 
available for values of pfit above 5, we parameterize this ratio with a parabolic function in 
the region (-5;5) and take the constant value of the ratio in the region (5;20). The ratio with 
the parabolic fit superimposed is shown in Figure [H We repeat the final fit to the data 500 
times for each parabola parameter and the value of the constant. We generate each parameter 
according to a Gaussian distribution centered at its mean value and having a sigma equal to 
its error, taking into account all correlations between different parameters. We weight the 
dataset by the generated ratio and repeat the fit. A Gaussian fit to the distribution of number 
of fitted signal events for each parameter is performed and the sigma of the the Gaussian fit is 
taken as systematic uncertainty and summed in quarature. The total systematic uncertainty 
from this procedure is 7 events. A summary of all systematic uncertainties in the fitted signal 
yield is provided in Table [3l 

The total systematic uncertainty is 1.1% on the number of B^B^ pairs, 4.2 % on signal efficiency 
and 15 events on the signal yield. 




Figure 4: MC/Data ratio of pfit distribution on /S.E and niEs sideband (Ai? > GeV and 5.2 
GeV/c2 < niES <5.246 GeV/c^). 



Table 3: Contributions to the systematic uncertainty on the signal yield. 



Source 


Total Error 


PDF parameters 


13 


Data/MC agreement 


7 


Total 


15 
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5 RESULTS 



From the data we extract -12 it 15 signal events and 600 it 29 background events. We expect 10 
events from MC assuming a SM branching fraction B{B^ Z^^^^^j) = 4.7 x 10"'^. The signal yield 
extracted corresponds to a central value B{B^ — > /i^f^) = (—5.7 ± 7.1(stat) ± 6.8(syst)) x 10"'^. 
Figure [5] shows the data points with the final fit superimposed. 

Given the number of fitted signal events, the signal efficiency and including all systematic 
uncertainties, we find the Bayesian UL assuming a flat prior for the branching fraction up to a 
maximum of B{B^ A'^^m) = 3 x 10~^ to be 

B{B+ /i+i/^) < 1.3 X 10"^ 

at the 90% confidence level. The 95% Bayesian UL is ^i+v^) < 1.6 x lO"*'. These results 

are more restrictive than previous measurements from BABAR [8| and Belle |9]. 



IT) 

d 50 

> 

30 
20 
10 


-5 5 15 n '^^ 

"fit 

Figure 5: Final fit to the data pfit distribution: full blue line is the total distribution, dashed red 
line is background distribution, dashed-dotted black line is signal distribution. 
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